Mining Association Rules Based Approach to Word Sense Disambiguation for Hindi Language
نویسندگان
چکیده
These days, the language is making hindrances in the advantages of Information Technology revolution in India. So, there is the need of the adequate measures to perform natural language processing (NLP) through computer processing so that computer based system can be interacted by users through natural language like Hindi. This paper presents a new Word Sense Disambiguation method associated with mining association rules. This method can mine the association rules between the sense of the ambiguous word and its context using its various senses present in the Hindi WordNet. The sense of the ambiguous word assigned by choosing the sense which the most association rules refer. In this paper, we are concerned with association rule mining for word sense disambiguation for Hindi language and finding the correct sense for given Hindi word. We explore the multiple meanings of Hindi word with the help of Hindi Word net prepared by IIT Bombay. The experiment result shows that the method has high precision. Keywords--Word Sense Disambiguation, WSD Approaches, Hindi WordNet, Mining assosiation rules. I. WORD SENSE DISAMBIGUATION In natural language processing, word sense disambiguation (WSD) is the problem of determining which "sense" (meaning) of a word is activated by the use of the word in a particular context, a process which appears to be largely unconscious in people. WSD is to make the computer automatically choose the correct meaning of each word, and it is still the biggest problem at vocabulary level in NLP. Words can have different senses. Some words have multiple meanings called Polysemy.[1] Word sense disambiguation (WSD) is the problem of determining in which sense a word having a number of distinct senses is used in a given sentence. Take example, Ambiguity for Humans and Computers: Ambiguity for Humans e.g. भहिरा एक छाता के साथ आदभी को भाया. (भहिरा एक छाता उऩमोग कयके आदभी को भाया मा वि एक आदभी जो एक छाता रे जा यिा िै उसे भाया) Ambiguity for computer in their day to day communication e.g.[2] आभ आभ आदभी की ऩरयधध से फािय िै (मिॉ आभ के दो भतरफ िै् पर औय साभान्म आदभी)
منابع مشابه
Word Sense Disambiguation in Hindi Language Using Hyperspace Analogue to Language and Fuzzy C-Means Clustering
The problem of Word Sense Disambiguation (WSD) can be defined as the task of assigning the most appropriate sense to the polysemous word within a given context. Many supervised, unsupervised and semi-supervised approaches have been devised to deal with this problem, particularly, for the English language. However, this is not the case for Hindi language, where not much work has been done. In th...
متن کاملWord Sense Disambiguation in Bengali applied to Bengali-Hindi Machine Translation
We have developed a word sense disambiguation(WSD) system for Bengali language and applied the system to get correct lexical choice in Bengali-Hindi machine translation. We are not aware of any existing system for Bengali WSD. Since there is no sense annotated Bengali corpus or sufficient amount of parallel corpus for Bengali-Hindi language pair, we had to use an unsupervised approach. We use a...
متن کاملDesign and Implementation of Association Rules Based System for Evaluating WSD
In this paper we presentnew method that usedassociation rules mining techniques in the field of natural language processing. Word sense disambiguation is persistently a central and challenging problem as increasing usage of internet in daily life. Every user has some queries that have to be searched on the internet. Transactional Database is created after preprocessing the text files. Ambiguous...
متن کاملA Highest Sense Count Based Method for Disambiguation of Web Queries for Hindi Language Web Information Retrieval
The ambiguity in word senses has been recognized as a major challenge for the information retrieval systems. Hindi language web information retrieval, like other languages, faces the problem of sense ambiguity. The sense ambiguity problem deteriorates the performance of every natural language processing (NLP) application. The performance of Hindi language web information retrieval is also affec...
متن کاملMining Class Association Rules for Word Sense Disambiguation
In this paper we propose an approach to the task of Word Sense Disambiguation problem that uses Class Association Rules to create an e ective and human-understandable rule-based classi er. We present the accuracy of classi cation of selected polysemous words on an evaluation corpus using the proposed method and compare it to other known approaches. We discuss the advantages and weaknesses of a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013